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Abstract 



In this paper we study the structure of polynomials of degree three and four that 
have high bias or high Gowers norm, over arbitrary prime fields. In particular we 
obtain the following results. 



o 
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1. Let / be a degree three polynomial with bias(/) = 5 then there exist r = 
' 0(log(l/<5)) quadratic polynomials c = 0(log 4 (^)) linear functions {£{} and 

a degree three polynomial g such that / = YH=i A ■ % + #(^l> ■ ■ ■ Ac) ■ This result 
generalizes the corresponding result for quadratic polynomials. 

2. Let deg(/) = 4 and bias(/) = 5. Then / = YH=i &i ' 9i + EI=i % ' l'i > where 
^ ! r = poly(l/<5), the £i-s are linear, the gj-s are quadratics and the gi-s are cubic. 

3. Let deg(/) = 4 and ||/||[/4 = 5. Then there exists a partition of a subspace 
°3 ! FCF", dim(V) > n - 0(log(l/5)), to subspaces {V a }, such that Va dim(V a ) > 

n/exp(log 2 (l/<5)) and deg(/[yj = 3. 

Items \TJ2\ extend and improve previous results for degree three and four polynomials 
[KL08, GT07J. Item [3] gives a new result for the case of degree four polynomials with 
high U 4 norm. It is the first case where the inverse conjecture for the Gowers norm 
fails [LMSOU IGT07| , namely that such an / is not necessarily correlated with a cubic 
polynomial. Our result shows that instead / equals a cubic polynomial on a large 
■ subspace (in fact we show that a much stronger claim holds). 

Our techniques are based on finding a structure in the space of partial derivatives 
of /. For example, when deg(/) = 4 and / has high U 4 norm we show that there exist 
quadratic polynomials {%}ie[ r ] and linear functions {^}j e [_R] such that (on a large 
enough subspace) every partial derivative of / can be written as A y (/) = Yld=i^i ' 
q? + YH=i Qi ' ^'i + <Zo> wnere &i > Qi depend on y, the direction of the partial derivative, 
r = 0(log 2 (l/||/||r/ 4 )) and R = exp(r). 
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search supported by the Israel Science Foundation (grant number 439/06). 



1 Introduction 



Assume that we are given a degree d polynomial / that, in some sense, 'behaves' differently 
from a random degree d polynomial. Is there anything that we can deduce about the structure 
of / just by knowing this fact? Recently this question received a lot of attention, where the 
'behavior' of / was examined with respect to its bias or the more general notion of the 
Gowers norm. 

Definition 1.1. Let f : ¥ n — > F be a function. The bias of f is defined as 

bias(f) = \E- ae wn[^ m }\ , 

where to = e w is a complex primitive root of unity of order |F|. 

Intuitively, the bias of / measures how far is the distribution induced by / from the 
uniform distribution. We expect a random polynomial to have a vanishing small bias (as 
a function of the number of variables), so it is interesting to know what can be said when 
the bias is not too small. Indeed, Green and Tao |GT07] showed that if / is a degree d 
polynomial over F, such that d < |F|, and bias(/) = 5 then / can be written as a function 
of a small number of lower degree polynomial. Formally, f(x) = F(g±, . . . , g Cd ) for some 
function F and Cd = Q(bias(/), |F|) polynomials {^} satisfying deg(pi) < d. Note that 
Cd = Q(bias(/), |F|) does not depend on the number of variables, i.e. it is some constant. 
This result was later extended by Kaufman and Lovett |KL08j to arbitrary finite fields (i.e. 
without the restriction d < |F|). Thus, if / has a noticeable bias, unlike a random degree d 
polynomial, then / is in fact very far from being random; simple counting arguments show 
that most degree d polynomials cannot be represented as functions of a few lower degree 
polynomials. This result is also interesting as it gives an average case - worst case reduction. 
Namely, if / has correlation 5 with a lower degree polynomial then it is a function of a small 
number of lower degree polynomials. One drawback of the results of |GT07t IKL08] is the 
dependance of the number of lower degree polynomials on the bias of /. In particular when 
deg(/L= 3, |GT071 IKL08] get the bound c 3 = exp(poly(l/bias(/))) and for deg(/) = 4 they 
bound_| c 4 by a tower of height c 3 . On the other hand if deg(/) = 2 and bias(/) = 5 then it 
is known that / can be written as a function of at most 21og(l/<5) + 1 linear functions. This 
can be immediately deduced from the following well known theorem. 

Theorem 1.1 (Structure of quadratic polynomials). (Theorems 6.21 and 6.30 in \LN9T$ ). 
For every quadratic polynomial f : F n — > F over a prime field F there exists an invertible 
linear transformation T, a linear polynomial i, and field elements a 1: . . . a n (some of which 
may be 0) such that: 

1. If char(F) = 2 then (q o T)(x) = ^i=i a i ' x 2i~i • %2i + i(x), 

2. If char(F) is odd then (q o T)(x) = Yli=i a i ' x l + £(x). 

Moreover, the number of non zero a$-s is invariant and depends only on f . 

lr These numbers are not explicitly computed there, but this is what the recursive arguments in the papers 
imply. 
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We thus see that there is a sharp contrast between the result for quadratic polynomials 
and the results for polynomials of degrees as low as three or four. We also note that the results 
of Kaufman and Lovett only guarantee that / can be represented as f(x) = F(gi, . . . ,g, 



but no nice structure like the one in Theorem 11.11 is known. It is thus an intriguing question 
whether a nice structural theorem exists for biased polynomials and what is the correct 
dependance of the number of lower degree polynomials on deg(/) and bias(/). 

As mentioned above, a more general measure of randomness that was considered is the 
so called Gowers norm of /. Intuitively, the U d Gower norm tests whether / behaves like 
a degree d — 1 polynomial on d dimensional subspaces. To define the Gowers norm we first 
define the notion of a discrete partial derivative. 

Definition 1.2. (Discrete partial derivative) For a function f : F n — > F and a direction 
y G F n we define A y (f)(x) = f(x + y) — f(x) to be the discrete partial derivative of f in 
direction y at the point x. 

It is not difficult to see that if deg(/) = d then for every y, deg(A 3/ (/)) < d — 1. We now 
define the d-th Gower norm of a function /. 

Definition 1.3 (Gowers norm |AKK + 05[ IGow98j IGowOlj ). The d-th Gower norm, U d , of 
f is defined as 



^ = ^,,*[^ (/)(,, ]| 1/i ' 1 



where again to = e i f i 



Note that ||/||t/o = WfWu 1 — bias(/). It is also clear that if deg(/) = d—1 then H/Ht/d = 1. 
For more properties of the Gowers norm we refer the reader to [Gow98[ TGowOlt IGT08[ I5a"m07[ 
IVW07] . 

In [AKK+05] Alon et. al. showed that if \\f\\ ud > 1 - 8, for some small 8, then / 
can be well approximated by a degree d—1 polynomial. This raises the question whether 
any function that has a noticeable U d norm is somewhat correlated with a lower degree 
polynomial and indeed in |Sam07t fGT08j this was conjectured to be the case. This conjecture 
has become known as the inverse conjecture for the Gowers norm. Samorodnitsky |Sam07] 
proved that if ||/||[/3 = 8 where / : Fg — > F2 is an arbitrary function, then / has an 
exponentially high (in 8) correlation with a quadratic polynomial. Namely, there exists a 
quadratic polynomial q such that Pr xg F« [/(a;) = q(X)] > 1/2 + exp(— poly(l/<5)). Green and 
Tao [GT08J obtained a similar result for fields of odd characteristic. These results gave an 
affirmative answer for the case of the U 3 norm. More generally, Green and Tao proved that 
if d < |F| and / is a degree d polynomial with a high U d norm then / is indeed correlated 
with a lower degree polynomial |GT07] . Recently, the case of large characteristic was solved 
by Tao and Ziegler jT208]i Using ideas from ergodic theory and the earlier jBTZj they 
proved that if / : F n — » D (where T> is the unit disk in C) is a function with high U d norm 
and d < |F| then / is correlated with a degree d—1 phase polynomial Jf] This completely 
settled the conjecture for the case d < |F|. On the other hand, for the U 4 norm it was 



2 In fact. |TZ08| only get a qualitative result. No explicit connection is known between the Gowers norm 
and the correlation with polynomials. 

3 A degree d—1 phase polynomial is a function of the form e 2Trl0 w s , for some degree d — 1 — (p — l)t 
polynomial g where 8 G [0, 1] and oj = e 27 ™/' F ' . 
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shown, independently, by Lovett, Meshulam and Samorodnitsky [LMS08J and by Green and 
Tao |GT07| that no such result is possible when F = F 2 . Namely, |LMS08j proved that 
the symmetric polynomial S^xi, . . . ,x n ) = 5^ TC [ n ] i t i =4 ELer 3 '*' which is of degree four, has 
a high U 4 norm but has an exponentially (in n) small correlation with any lower degree 
polynomial. Similar examples where given for other fields (when d is large enough compared 
to the size of the field). These examples show that for small fields the inverse conjecture 
for the Gowers norm is not true in its current form. In their work, Tao and Ziegler |TZ08j 
proved a variant of the conjecture for the case d < |F|. Namely, that if a function / has high 
jjd norm thgn f i s correlated with a phase polynomial of a certain constant degree (but not 
necessarily smaller than d). We note however, that if deg(/) = d then the results of |TZ08j 
do not give any information on /. In fact, even if deg(/) = 4 and / has a high U 4 norm then 
nothing is known on the structure of /. It is thus a very interesting question to understand 
the structure of low degree polynomials having high Gowers norm over small fields. 

Besides being natural questions on the own, results on the Gowers norms had many 
applications in mathematics and computer science. In his seminal work on finding arithmetic 
progressions in dense sets, Gowers first defined the U d norm (for functions from Z n to Z„) 
and proved an inverse theorem for them that was instrumental in his proofs |Gow98l IGowOl] . 
Bogdanov and Viola |BV07j attempt for constructing a pseudo random generator for constant 
degree polynomials relied on the (erroneous) inverse conjecture for the Gowers norm, yet it 
paved the way for other papers solving the problem |Lov08t IVio08j . In |ST06| applications of 
an inverse theorem for the Gowers norm to PCP constructions was given. Samorodnitsky's 
proof of the inverse theorem for the U 3 norm |Sam07] implies a low degree test distinguishing 
quadratic functions from those that do not have a non trivial correlation with a quadratic 
function. This result also gives a test for checking the distance of a given word from the 2nd 
order Reed-Muller code, beyond the list decoding radius. For a more elaborate discussion of 
the connection between additive combinatorics and computer science see [Tre09j . 

1.1 Our results 

In this work we are able to show analogs of Theorem 11.11 for polynomials of degree three 
and four. We also prove a structural result for the case that such a polynomial has a high 
Gowers norm. Our first main result is the following. 

Theorem 1. (biased cubic polynomials) Let ¥ be a finite field and f G ¥[x\, . . . , x n ] a cubic 
polynomial (deg(f) = 3) such that bias(/) = 5. Then there exist c\ = 0(log(l/<5)) quadratic 
polynomials qi, . . . , q Cl G ¥[xi, . . . , x n ] and linear functions £i, . . . , £ C1 G ¥[xx, . . . , x n ] and 
another c 2 = 0(log 4 (|)) linear functions £[, . . . , £' C2 G ¥[x\, . . . , x n ] such that f = ' 
Qj + 9(£[-> ■ ■ ■ i ^c 2 )' w h ere 9 is cubic. 

We note that if it weren't for the g{£[, ■ ■ ■ ,£' C2 ) part then this result would be quantita- 
tively the same as Theorem 11.11 (and tight of course). It is an interesting open question to 

decide whether we can do only with the Y^j=° Sm ^ ' part- Using the same techniques 
we show a similar result for the case that ||/||c/ 3 > <5- 

Theorem 2. (cubic polynomials with high U 3 norm)Let ¥ be a finite field and f G 
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¥[xi,...,x n ] a cubic polynomial such that \\f\\u 3 — 5. Then there exist c + 1 = 
0(log 2 (|)) quadratic polynomials q , . . . ,q c G ¥[xi, . . . , x n ] and c linear functions £i, . . . , l c G 
¥[xi, ...,x n ] such that f = J2j=i ?j " Qj + Qo- 

Note that the difference between the structure of / in Theorems [T] and [2] is the number 
of quadratic function required. Recall that in |Sam07] Samorodnitsky proved that if an F 2 
function / has a high U 3 norm then it has an exponentially (in H/H1/3) high correlation with 
a quadratic polynomial. Thus, our theorem shows that when / is a cubic polynomial then a 
much stronger statement holds. Namely, / has correlation exp(log (1/5)) with a quadratic 
polynomial, and further, has a nice structure. 

Our second main result is an analog of Theorem [T] for the case of quartic polynomials 
(i.e. deg(/) = 4). 

Theorem 3. (biased quartic polynomials) Let ¥ be a finite field and f G ¥[xi,...,x n ] a 
quartic polynomial (deg(f) — 4) such that bias(/) = 5. Then there exist 4c = poly(|F|/5) 
polynomials {hiQuQi, 9i}i=i; where the ti-s are linear, the qi-s and q^-s are quadratic and the 
Qi-s are cubic such that f = Ylj=i ' 9j + Ylj=i Qj ' Qj- 

As mentioned above, prior to this result it was known that there exist C cubic polynomials 
gi,...,g c and a function F such that / = F(gi, . . . , gc), where C is a tower of height 
exp (poly (1/5)) [G T071 IKL08] . Thus, our result greatly improves the dependance on 5 and 
gives a nice structure for the polynomial. We note that in their work Green and Tao do show 
that such a nice structure exists when d < |F| |GT07j . but no such result was known for 
smaller fields (in addition C needs to be even larger for such a nice representation to hold). 

Our third main result is for the case where deg(/) = 4 and ||/||c/ 4 = 5. In such a case it 
is known |LMS08l IGT07] that we cannot hope to get a nice structure as in Theorem [2] as it 
may be the case that / has an exponentially small (in n) correlation with all lower degree 
polynomials. However, we do manage to show that there is some subspace U C F n such that 
when restricted to V, f\u is equal to some degree three polynomial. Thus, / does not have a 
correlation with a cubic polynomial in the entire space but instead there is a large subspace 
on which it is of degree three. In fact we show a more general result. Namely, that there 
is a large subspace V, of dimension n — 0(log(l/<5)), that can be partitioned to subspaces 
of dimension n/ exp (log 2 (1/5)) such that the restriction of / to any of the subspaces in the 
partition is of degree three. 

Theorem 4. (quartic polynomials with high U 4 norm) Let ¥ be a finite field and f G 
¥[xi, . . . , x n ] a degree four polynomial such that \\f\\u 4 — 5. Then there exists a partition of a 
subspace V C F n , of dimension dim(V) > n — 0(\og(l\/5)), to subspaces {V a } a£l , satisfying 
dim(\4) = fi(n/|F| lo § ^), such that for every a E I, f\y a is a cubic polynomial. 

Remark 1.4. Note that the structure guaranteed in Theorem^is shared by very few polyno- 
mials. Specifically, a random polynomial of degree four is unlikely to be equal to any degree 
three polynomial on any subspace of dimension larger than, say, n - 9 . To see this note that if 
\¥\ = p and dim(V) = d then there are roughly p^ cubic polynomials andp d4 quartic polyno- 
mials over V . Furthermore, the map taking a quartic polynomial over¥ n to its restriction is 
a linear map and so the fraction of quartic polynomials that equal a degree three polynomial 
on V is (roughly) p~ d4 + d3 . As the total number of subspaces can be bounded by p n2 we get 
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that the fraction of quartic polynomial that are equal to a degree three polynomial on some 
subspace of dimension greater than n ' 9 is at most p n2 - n3S '+ nLi — exp(— n 3 ' 6 ). 

This result has the same flavor as the inverse U 3 norm theorem of |GT08j . There it 
was shown that if / : Fg — > F5 satisfies ||/||[/3 = 5 then there exists a subspace V of 
codimension poly (1/5), such that on an 'average' coset of V, f is correlated with a quadratic 
polynomial. Recently, Wolf |Wol09j proved a similar result for the case of characteristic 
two, thus extending Samorodnitsky's argument |Sam07j . The main difference between these 
results and our result is that ours only holds for polynomials of degree four whereas the 
results of |GT08l Sam07, Wol09j hold for arbitrary functions. On the other hand our result 
holds for the U A norm compared to the U 3 norm studied there. Moreover, when char(F) > 4, 
using the same techniques we can actually show that / must have a structure similar to the 
one guaranteed by Theorem [3J 

Theorem 5. Let ¥ be a finite field with char(F) > 4 and f e ¥[x±, . . . ,x n ] a degree four 
polynomial such that \\f\\m — 3. Then 

R r 

f = ' 9t + ^Qi- <li , 

1=1 1=1 

for r = 0(log 2 (l/5)) and R = exp(log 2 (l/<5)) where ti is linear, q^q\ are quadratic and gi 
cubic. 

1.2 Proof Technique 

The main approach in all the proofs is to consider the space of discrete partial derivatives 
of / and look for some structure there. We will explain the idea for the case of degree three 
polynomials and then its extension to degree four polynomials. 

Let / be a degree three polynomials. Assume that / has high bias (alternatively, high U 3 
norm). By a standard argument it follows that a constant fraction of its derivatives, which 
are degree 2 polynomials, have high bias (high U 2 norm). By Theorem 11.11 it follows that 
for a constant fraction of the directions, the partial derivatives depends on a small number 
of linear functions (same for the U 3 norm). Hence, in the space of partial derivatives, a 
constant fraction of the elements depend on a few linear functions. We now show that there 
must be a small number of linear functions that 'explain' this. More accurately, we show that 
there exist a subspace V C F n , of dimension dim(V^) = n — 0(1), and 0(1) linear functions 
i u ... ,4, such that for every y e V it holds that A„(/) = ELi ^ ' + \ where the 
are linear functions determined by y. 

We are now basically done. Consider the subspace U — {x : £i(x) — . . . — £ c (x) = 0}. 
Then, for every y £ V it holds that A y (f)\u = £q\u- This implies that /|y = YH=i ^'Qi + Qo, 
where the g^-s are quadratic polynomials. As dim(V) = n— 0(1) we obtain the same structure 
(with a different constant c) for /. 

To prove the result for biased degree four polynomials we follow the footsteps of |KL08] 
with two notable differences. Let / be such a polynomial. First, we pass to a subspace on 
which all the partial derivatives of / have low rank as degree three polynomials. This steps 
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relies on our results for biased degree three polynomials. Then, as in |KL08j . we show that 
/ can be approximated by a function of a few of its derivatives. Because of the properties of 
the derivatives, this means that / can be approximated well by a function of a few quadratics 
and linear functions. We then show, again following |KL08] , that in such a case / is actually 
a function of a few quadratics and linear functions. Here we heavily rely on properties of 
quadratic functions to avoid the blow up in the number of polynomials approximating / that 
occurs in |KL08l IGT07] . Finally, we show that if a degree four polynomial is a function of 
several quadratic and linear functions then it actually have a nice structure. 

The proof for the case of degree four polynomials with high U A norm is more delicate. 
Assume that / is such a polynomial. As before, a constant fraction of the partial derivatives of 
/ are degree three polynomials with high U 3 norm. By the result for degree three polynomials 
we get that each of those partial derivatives is of the form A y (f) = YH=i ^'Qi ; +?o ■ Again 
we find a subspace V, of constant co-dimension, such that for every y £ V, A y (/) has a nice 
structure. We now show that there exist a small number of linear and quadratic functions 
{4 gi } c l=1 such that for every y £ V it holds that A y (f) = £? =1 £ { ■ + ££ =1 ^ ' ft + Qo\ 
where the polynomials {i^ ,q\ y '} depend on y. This is the technical heart of the proof. It 
now follows quite easily that there is a subspace U C V of dimension nj exp(c) such that 
when restricted to U all the functions g^} are fixed to constants. Thus, for every y £ U 
it holds that deg(A 3/ (/)) = 2. So we get that &eg(f\u) = 3. In fact, by closely examining 
the argument above we show an even stronger result. Namely, that we can partition a large 
subspace of F n to (affine) subspaces of dimension nj exp(c) such that on each of the subspaces 
/ is equal to some cubic polynomial (that may depend on the subspace). 

1.3 Organization 

In Section [2] we give some basic definitions and discuss properties of subadditive functions. 
In Section [3] we prove the theorems concerning degree three polynomials. In Section [4] we 
prove Theorem [3] and in Section we prove Theorems H] and 

2 Preliminaries 

In this paper F will always be a prime field. We denote with F p the field with p elements. 
As we will be considering functions over F p we will work modulo the polynomials x\ — X{. 
In particular, when we write / = g, for two polynomials, we mean that they are equal as 
functions and not just as formal expressions. This will be mainly relevant when we consider 
quadratic polynomials (or higher degree polynomials) over F2. More generally, we shall say 
that a function / has degree d if there is a degree d polynomial g such that / = g. Note 
that this does not have an affect on the bias and the Gowers norm. Namely, the bias and 
jjd norm Q f f <-[o not change when adding multiplies of x\ — Xi. Finally we note that if all 
the partial derivative of / have degree at most d — 1 then there is a polynomial g of degree 
at most d such that / = g (this is easily proved by observing that a degree k polynomial, 
all of whose individual degrees are smaller than |F|, always has a partial derivative whose 
degree is k — 1). From this point on we shall use the notion of a function and a polynomial 
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arbitrarily without any real distinction. 



The Fourier transform of a function / : F n — > ¥ is defined as 



f(a) = E xeF n[f(x)x a (x)] , 

where for a = («i, . . . , a n ), Xa( x ) — w^ =1 aiXl where u = is a complex primitive root of 
unity of order |F|. For more on Fourier transform see [Ste03j . 

We say that a function h e-approximates a function / if Pr x [/(x) ^ h(x)] < e. 

Definition 2.1. Following [KL08] we say that the distribution induced by a set of functions 
{hi}^ (all from ¥ n to F ') is 7 close to the uniform distribution if for every a 1; . . . , a m G F 
it holds that 



Pr [VI < i < m, hAx) = aA - |F| 

x£¥ n 



< 7 |F| 



The following well known lemma bounds the distance between distributions using the 
Fourier transform. 

Lemma 2.2. For i = 1 . . . m let hi : ¥ n — > F be a function. Then, the distribution induced 
by the hi-s is 7 close to uniform if for every nontrivial linear combination h a = YliLi 06 ^' 
we have that bias(/i a ) < 7 /|F| 3m / 2 . 

Proof. Let H : ¥ n -»■ F m be defined as H(x) = (hi(x), h m (x)). For y G F m let f(y) = 
Pr xg F" [H (x) = y]. We have that 



\f(a)\ = E yeF ™ \f(y)xa(y) 



E 



IFI 



E Xa[H{x)] 



x£¥ n 



|F|- m bias K^aA 



Therefore, 



(j2 \f(y)-m- m \) < \¥\ m J2\f(y)-\ ¥ \' m \ 2 

\ye¥ m J y£¥ m 

= \¥\ m E /(^) 2 - 2 i F r m /(2/) + i F i _2m 

y£¥ m 

E i F r/» 2 ) - 1 = f E i F i 2m /(«) 2 ) 



bias j ctj/ij J < |F| 



-2m^2 



.1=1 



Hence, for every y G F m it holds that \ f(y) — |F| m | < |F| m 7 , which is what we wanted to 
prove. □ 
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2.1 Subadditive functions 



As described in Section 11.21 our proofs are based on finding a structure for the space of 
partial derivatives of the underlying polynomial /. For this end we need a special case of 
the Bogolyubov-Chang lemma (see e.g. |Grej ). 
For a set A C F n denote with kA — kA the set 

kA — kA = {a\ + . . . + a k — a k+l — ... — a 2 k | Vi a, G A} . 

Lemma 2.3 (Bogolyubov-Chang). Let A C U be a subset of a linear space U such that 
\A\ = p ■ \U\. Then, for some k < max(l, [|(log |f| (2/ji ) + 2)1), kA — kA contains a 

|F| — 1/2 

subspace W of co-dimension at most log ifi-i/2 (l/2/x ). 

F|-l 

For completeness we give the proof here. 

Proof. For p G (0, 1) define p(p) = ^m^ 2 - • p. We shall think of A also as the characteristic 

function of the set A and denote with {v4(a)} its fourier coefficients. Assume that there is 
some q^O such that |A(a)| > p(po)- This means that there is some (affine) subspace W of 
co-dimension at most one such that 

IFI — 1/2 

\A n W\/\W\ > p(p ) ■ |F|/(|F| - 1) = 1 _ [ ■ l*> = (1 + e)Li , 

where e = 2 | F |_ 2 . In other words, the density of A on If is (1 + e) larger than its density 
over the entire space. We continue restricting A to co-dim one subspaces (updating p and 
considering p(p) at each step) until after at most t = logjFi-1/2 (l/2p ) steps we reach one 

|F|-1 

of two possibilities. Either we get a subspace V C U of co-dimension at most t such that 
\A n V\ > \V\/2, or AnV(a) < p(p) for every a ^ 0, where p < At = \A n V|/|V|. 
In the first case it is clear that (A fl V) + (A fl V) = V and so we found a subspace V 
of co-dimension at most t contained in A + A. In the second case where all the non-zero 

Fourier coefficients are smaller than p(p) we show that for k = \^(\ogw\(2/u) + 2)1 it 

[FF172 

holds that k(A fl V) — k(A D V) — V. For this end we follow the proof of Lemma 4.4 in 
|Grej . Let B = A fl V. For x G V denote with r^rc) the number of representations of x as 
a% + . . . + at, — a' t — . . . — a' k where the a^-s and a^-s are from B. Clearly, r^(x) is equal to 
the sum, over all (y x , . . . , y k , zx, ■ ■ ■ , z k -x) G B 2k ~ l , of A(yx) ■ A(y 2 ) ■ ... ■ A(y k ) ■ A{z x ) ■ ... ■ 
A(zk-x) ■ A(yx + . . . + yk — zx — . . . — z k ~x — x). Writing the Fourier expansion A and using 
routine calculations we conclude that 

r k ( x ) = \W\( 2k -V n ■ \B(a)\ 2k Xa (x) > |F|( 2fe -^" • [ B(0) 2k - \B(a)\ 2k ) > 

iFp" 1 )" ■ ^B(0) 2k - p(p) 2fc ~ 2 ^ = \¥\^ n ■ (p 2fe - p(p) 2fc ~ 2 p) > , 

where the last inequality follows from the choice of k (we also used the fact that A is a 0/1 
function). In particular, V C kA — kA as needed. □ 
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We will mainly apply the lemma on sets A C F n containing all directions where the 
partial derivatives of our underlying polynomial / are either very biased or have a high 
Gowers norm. More generally we define the notion of a subadditive function below. 

Definition 2.4. Let V C ¥ n be a linear space. J 7 : V — > 1R + is a subadditive function if for 
every u, v G V and a G F it holds that J^{a ■ u + v) < ^{u) + ^(v). 

Lemma 2.5. Let T : U — > M + be a subadditive function. Define, A r = {x G U \ J-'(x) < r}. 
If \A r \ > fi\U\, then there exists a vector space V of co-dimension at most log |F|-i/2 (1/2/i) = 

F|-l 

0(log(l//i)) such that for every y G V it holds that I \y) < 2r- [|(log m (2/n) + 2)] +2r = 
0(r log(l///)). 

Proof. The proof is immediate from Lemma [2.31 Let V be the subspace guaranteed by the 
lemma when applied on A r . As V C kA r — kA r , for k < max(l, [^(log ifi (2//i) + 2)]), we 

|F|-l/2 

get that J^{y) < 2kr for every y G V. □ 

A typical example of a subadditive function will be the rank of a quadratic polynomial. 

Definition 2.6. Let q be a degree two function over a prime field ¥. We define rank^g) = r, 
where r is the number of oii-s that are non zero when considering the canonical representation 
of q in Theorem 

The following lemma is immediate. 

Lemma 2.7. For two quadratic polynomials q, q' and a constant a G F we have that rank2(g+ 
aq') < rank 2 (g) + rank 2 (g'). 

A more interesting example is given in the following lemma. 

Lemma 2.8. Let f be a cubic polynomial over a prime field F. For every y G F n define 
J^iy) = rank 2 (A J/ (/)). Then J 7 is a subadditive function. 

Proof. The proof follows from the following simple observation 

A,(/) + A z (/) = f( X + y)-f( X ) + f( X + Z )-f( X ) 

= f(x + y + z)- f(x) - (f(x + y + z)-f(x + y)- (f(x + z) - f{x))) 
= A y+Z (f)(x) - (A z (f)(x + y)- A z (f)(x)) 
= A y+Z (f)(x) - A y A z {f){x) . 

Indeed, we now get that F{y+z) = rank 2 (A y+2 (/)) = rajak 2 (A y (f) + A z (f) + A y A z (f )(x)) = 
rank 2 (A y (/) + A z (f)) < rank 2 (A 2/ (/)) + rank 2 (A 2 (/)) = F(y) + T{z), where we used the 
fact that adding a linear function to a quadratic polynomial does not change its rank. □ 

3 The structure of cubic polynomials 

In this section we prove Theorems Q] and [2j As described in Section [L2l both proofs are based 
on finding a structure for the space of partial derivatives of /. 
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3.1 Restricting the polynomial to a 'good' subspace 

In this section we show that if a cubic / ia biased or have a large U 3 norm then there is a 
subspace ^CF" such that for every y e V the rank of A y (/) is relatively small. We start 
by showing that if / is biased or has a high Gowers norm then so do many of its partial 
derivatives. The following lemmas are well known and we prove them here for completeness. 

Lemma 3.1. Let f : F" — > ¥ p be such that bias(/) = 5. Then a fraction of at least ^5 2 of 
the partial derivatives A y (/) satisfy bias(Aj / (/)) > \5 2 . 

Proof. We first compute the expected bias of a partial derivative with respect to a random 
direction. 

E ye¥ n [bias(A„(/))] 



= 5 ■ 5 = 5 2 . 



f(z) UJ -f(x)l 



Therefore, by the fact that bias(/) < 1, it follows that 

1 



Pr 



bias(A,(/)) > -5 2 



□ 



A similar result holds when / has a high U d norm. 



Lemma 3.2. Let f : F" ' -> F p be such that 



u d 



5. Then a fraction of at least \8 of 



the partial derivatives A y (/) satisfy ||A y (/)|| C /d-i > \8 2 . 
Proof. The proof is again immediate from the definition. 



5 2 



2" 
U d 



< E 



E, 



I|a,(/)||^: 



As before we get that 



Pr 



c-2 d 



□ 



We thus see that in both constant fraction of all partial derivatives of / have 

high bias or high U 2 norm. From Theorem 11.11 we get that if a partial derivative (which is 
a quadratic function) has a high bias then it depends on a few linear functions. 

Lemma 3.3. Let q be a quadratic polynomial over a prime field F. Then q is a function of 
at most log| F |(bias(g)) + 1 linear functions. More accurately, in the notations of Theorem \l.l\ 
the number of non zero aj-s is at most log| F | (1 /bias(g)) . 
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Proof. See e.g. Lemmas 15-17 of |B V07] . 



□ 



The next lemma of Bogdanov and Viola [BV07J shows that a similar result holds when 
a partial derivative has a high U 2 norm. 

Lemma 3.4. (Lemma 15 of IBVOl^ ) Every quadratic polynomial q over a prime field F 
is a function of at most log wi ( 1 / 1 1 1 1 C7 2 ) + 1 linear functions. Further, in the notations of 
Theorem \l.l\ the number of non zero a,-s is at most logi F i(l/||g||j/2). 

Concluding, we have proved the following lemma (recall Definition 12. 6p . 
Lemma 3.5. Let f be a cubic polynomial. 

1. 7/bias(/) = 5, then for (at least) a y fraction of y G F n it holds that rank^A^/)) < 
lo g|F|(Jr)- 

If \\f\\u 2 — then for (at least) a y fraction of y G F n it holds that rank 2 (A J/ (/)) < 
l°g| F |(Jr)- 

We now combine Lemma 12.81 with Lemma 13.51 and Lemma 12.51 and obtain the following 
corollary. 

Corollary 3.6. Let f be a cubic polynomial. 7/bias(/) = S or \\f\\u 2 — then there exists 
a subspace such that dim(V) > n — 0(log(|)) and such that for every y G V it holds 

t/^rank 2 (A,(/)) = 0(log 2 (|)). 

3.2 The structure of low rank spaces 

So far we have established the existence of a subspace V C F n such that for every y G 
V it holds that rank 2 (A ?/ (/)) = 0(log 2 (|)). We now show that such spaces of low rank 
polynomials have a very restricted structure. Namely, there exist r = 0(log 2 (^)) linear 

functions £±, . . . ,£ r such that every A y (f) can be written as A y (f) = YJi=i^i ' H + 
where the £\ v ^-s are linear functions determined by y. The intuition behind this result is 
that rank2(g + q') can be much smaller than rank2(g) + rank2(</) only if there is some basis 
with respect to which q and q' share many linear functions when represented in the form of 
Theorem ll.il From this observation we deduce that if we consider some function of maximal 
rank, q = Y!h=i ^i'^n an d set {£i, £'iY i=l to zero (namely, consider the subspace on which they 
all vanish), then on this subspace the rank of the remaining quadratic functions decreases by 
a factor of two. Repeating this argument we get that after setting at most 4r linear functions 
to zero, all our quadratic functions become linear functions. 

Lemma 3.7. Let M be a linear space of quadratic functions satisfying rank 2 (p) < r for all 
p G M. Then there exists a subspace V CP of co-dimension < 4r such that p\y is a linear 
function for all p G M . 

We shall give the proof for the case F = F 2 . The proof for odd characteristics is very 
similar (except that in the odd characteristic case we have that the co-dimension of V is 2r 
whereas in the even characteristic case it is 4r). 
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Proof. Let g G M be such that rank 2 (g) = r. By Theorem II. 1| g can be expressed as 
g = 52f=i^2i-i ' + ^o- Denote V = {x | ^i(x) = £%(x) = ... = C-2r{.%) = 0}. We now show 
that for every h G M it holds that rank 2 (/i|y) < |. Repeating this argument we get that 
after setting at most 2r + 2(r/2) + 2(r/4) + . . . < 4r linear functions to zero, the rank of all 
the quadratic functions in M became zero. 

Pick some h G M and denote rank 2 (/i|y) = s. As before, h\ v can be expressed as 
h\v = Yli=i m 2i-i ' m 2i + (where the m r s are linear functions). Clearly the functions 
{El, . . . , £2r, mi, . . . , m 2s } are linearly independent. We can therefore write 



2r 



i=l i=l 

where the Lj-s are linear functions. Write Lj = m 8 + li + Lj where m; G spanjmo, . . . , W2s}, 
£j G span{£ , • ■ ■ , £ir\ and Lj is linearly independent of the m^-s and ^-s. Rearranging terms 
we get that 

s 

h = ^(m 2 ;_i + ■ ( m 2i + 4i) + ( m o + ^ ) + H^o, • • • , ^2r, • • • , L 2r ) , 

1=1 

where each ^ is in the span of the £i-s and ft. is a quadratic polynomial. Denote = irti + £' i . 
It is clear that £ , . . . ,£ 2r , L\, . . . , L 2r are linearly independent of the m~s (and vice versa). 
Consequently^ rank 2 (J^* =1 m' 2i _ 1 ■ m' 2i + m' ) + rank 2 (/i) = rank 2 (/i) < r. Hence, rank 2 (/i) < 
r — s. We now get that 

r > rank 2 (g + h) = rank 2 I ^m' 2i _ 1 • m' 2i + m' Q + h(£ , . . . ,£ 2r , L\,..., L 2r ) + g 

\i=l J 

rank 2 ^2 m 2i-i ' m 2i + m 'o^j +rank 2 (g + h(£ , . . . ,£ 2r , L%,..., L 2r )j > s+(r—(r—s)) = 2s , 

where we used the fact that rank 2 (g + h) > rank 2 (g) — rank 2 (/i). As we showed that r >2s 
the proof is completed. □ 

3.3 Completing the proofs 

We are no ready to complete the proofs of Theorems [1] and [2j 

Proof of Theorem^ By Corollary 13.61 we get that if ||/||cr 2 = 8, then there exists a subspace 
l^CF™ such that dim(l/) > n — 0(log(l/5)) and such that for every y G V it holds that 
rank 2 (A y (/)) = 0(log 2 (|)). Lemma I3~7H implies that there are at most r = 0(log 2 (|)) linear 

functions £\,...,E r such that for every y G V we have that A y (f) = YH=i ^ ' H + '• Let 
U — {x G V | £i(x) = ...£ r (x) = 0}. Then [/ is a linear space of dimension dim([7) > 



4 From Theorem 11.11 it is clear that for quadratic polynomials (71,(72 it holds that rank2((7i(x) + (72(2/)) 
rank 2 (gi(i)) + rank 2 (<7 2 (y)). 
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n — 0(log 2 (|)). For every y G U we have that A y (f)\u = i \u- Hence, for every y G U, 
deg(A 3/ (/)) < 1. Therefore, deg( f\u) < 2. Let £[,..., £' t be linearly independent linear 
functions such that x G U iS £[(x) = ... = £' t (x) = 0. It follows that we can write 
/ = Xli=i ' 1i + f° r some quadratic polynomials {gj}. As t = n — dim([7) = 0(log 2 (|)) 
the result follows. □ 

The proof of Theorem [1] is essentially the same except that we make another small 
optimization that reduces the required number of quadratic functions. 

Proof of Theorem^ By the same argument as above we get that / = Yli=i ^i"9i+9o f° r some 
quadratic polynomials {g^} and linear functions {£j}, where t = 0(log 2 (|)). For convenience 
we shall assume w.l.o.g. that 

t 

f = ^2 x i- 1i + 1o ■ (!) 

i=i 

The following lemma shows that by adding a few more linear functions we can assume that 
no nontrivial linear combination of the g^-s has a low rank. 

Lemma 3.8. Let qi, . . . , q t be quadratic polynomials over F n . Then, for every r there exist 
a subspace V C F n of dimension dim(V) > n — t(r + 1), and t' < t indices ii, . . . ,if such 
that for every affine shift V of V the following holds 

1. For all i, g f |v G span{l, q^y, ■ ■ ■ , %,\v}- 

2. For any non trivial linear combination we have that rank 2 (^2]=i a iQij\v'J > r - 

Proof. The proof is by induction on t. For t — 1 the claim is clear: If rank 2 (gi) > r 
then we are done. Otherwise we have gi = Y^h=i ^2i-i^2i + A)- Letting V = {x \ £q(x) = 
^(x) = ... = l-2r{x) — 0} the claim follows (indeed notice that passing to an affine shift 
of V simply means fixing the £i-s to arbitrary values). Assume now that we have q±, . . . ,qt 
and that (w.l.o.g.) rank 2 (q t + J2l=i a iQi) - r - Write q t + YhZx a iQi = Yh=i ^i-ihi + to- 
Set V = {x | £q(x) = £2(2) = . . . = £2r( x ) — 0}. Then, q t \ v G spangly. . . . ,q t _i\ v }. As 

dim(V) = n— (r+1) the claim follows by applying the induction argument to qi\y , qt-i\v 

(again the claim about any affine shift follows easily). □ 

We continue with the proof of the theorem. Having Equation ([1]) in mind we set U = 
{(0, . . . , 0,x t+ i, ■ ■ . ,x n )} C F™. Applying Lemma ES] on qi\ v , . . . ,q t \ v with r = log| F |(2/5) 
we get that there is a subspace W C U and t' <t such that: dim(H / ) > dim(C/) — (r + l)t > 
n - (r + 2)* = n - 0(log 3 (|)); w.l.o.g. for every i = 1 . . . t, q { \ w G span{gi| w , . . .,q t >\w}] 
any nontrivial linear combination of qi\w , ■ ■ ■ , Qt'\w has rank larger than r. By applying an 
invertible linear transformational we can further assume that W = {x G F m | x\ — . . . — 
x m = 0} for some m < (r + 2)t. For i — 1 . . . if let q[ — qi\w- Note that g- does not contain 
any of the variables Xi, . . . , x m . We can rewrite Equation ([I]) ag^l 

t' t m 

/ = E^+EE x ^' ( 2 ) 

i=l i=l j=l 

°This step is not really required but we continue using it just to make the proofs easier to read. 
6 We will later explain why go 'disappeared' from this expression. 
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where the i^-s are linearly independent linear functions in x\, . . . ,x t . We now show that 
t' < logi F i(2/<5). Assume for contradiction that t' > logi F i(2/<5). As 



bias(/) =E ai) „. itv bias(/(xi,...,x„)| ( ^ r ..^ /)= ( aij ... )Q!( ,) , 

■,x m ) = {Pi, ■ ■ • > Pm) satisfying (£[, 



there exists an assignment (x\ 
(cki, . . . , a t i) 7^ such that 



tt' t m \ 

E «^ + E A E - 5 _ w - 5/2 • 
i=l i=l j=l / ' 



Therefore, for some constants oti, . . . , otf (where not all a±, 

f 



a t > are zero) we have that 



bias E a ^ + i - 6 1 2 



,i=i 



for some linear function I. By Lemma 13.31 we get that 

t' \ / t' 



rank 2 E a ^ = rank 2 E a d + £ ) ^ lo S|F|(V(V 2 )) 



j=i 



J=l 



in contradiction to the choice of q[ , . . . , q' t , . 

To complete the proof we explain the reason for dropping q Q . Indeed, consider Equa- 
tion (CD). Let U — {x | X\ — . . . — x t — 0}. Set g"j = qi\u- Then we can rewrite 
© as Si=i x iQi + Qo + 52i=i 5_)j=i x j^i,ji f° r some linear functions £ij. Now, for some 
ai, . . . , a t we get that bias(X]* = i + Qo + Si=i a ' 5^=1 a j £ i,j) — Lemma 13.31 implies 
that rank 2 (^* = i a iQi + Qo) < logijpi (l/<5) and so we can replace g by a linear combination of 
the other gVs and a function depending on a few linear functions. By passing to a (possibly 
affine) subspace of dimension at least n — log™ (1/5) — 1 we get a representation for / without 
go- This operation increases t' in Equation (jjj) by no more than log™ (1/5) + 1 and so we are 
done. □ 



4 The structure of biased 4 degree polynomials 

In this section we prove Theorem [3] on the structure of biased degree 4 polynomials. As in 
the case of cubic polynomials, we shall focus our attention on a subspace on which all of 
derivatives have a small rank (a cubic polynomial is of low rank if it depends on a small 
number of linear and quadratic functions). By a lemma of Bogdanov and Viola |BV07] 
(Lemma 14.31) we get that / can be well approximated by a function of a small number 
of its derivatives (which in our case, are all of low rank). Thus, / is well approximated 
by a function of a few linear and quadratic polynomials. By passing to a subspace we can 
assume that / is well approximated by a function of a small number of quadratic polynomials. 
Lemma 1222 implies that (possibly on a slightly smaller subspace) / can be well approximated 
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by a function of a small number of quadratics, that every nontrivial linear combination of 
them has a high rank. We then show that in this case those quadratic functions are in fact 
strongly regular (a notion that we later explain) and therefore by a theorem of Kaufman and 
Lovett |KL08j . / in fact equals a function in those quadratic (on the subspace). We then 
finish the proof by showing that in this case / also have a nice structure. 

4.1 Restricting the polynomial to a 'good' subspace 

In this section we prove an analogous result to Corollary 13.61 We first define the rank of a 
cubic polynomial. 

Definition 4.1. Let g be a degree three polynomial. We define rank 3 (g) to be the minimal 
integer r for which there are r linear functions £i,...,£ r and r + 1 quadratic functions 
q ,...,q r such that g = YJi=i ^ + Qo- 

Lemma 4.2. Let f be a degree four polynomial satisfying bias(/) = 5. Then there exist a 
linear subspace V C F n of dimension dim(V) > n — 0(logi F i(l/5)) ; such that for every y G V 

rank 3 (A y (/))=log° (1) (l/5). 

Proof. As before, define F(y) = rank 3 (A y (/)). It is again not difficult to see that J 7 is a 
subadditive function. By Lemma 13.11 we get that there is a subset S C F n of size y ■ ¥ n 
such that for all y G S, bias(A y (/)) > y. Theorem [1] implies that for every y G S it holds 
that rank 3 (A y (/)) = 0(log 4 (|)). From Lemma 1231 it follows that there is a linear subspace 
V C F n with dim(V) > n — 0(log| F |(l/<5)), such that for every y G V rank 3 (Aj / (/)) = 

o(iog 5 (|)). □ 

By applying an invertible linear transformation we can assume that V = {x : X\ = . . . = 
x m = 0} for some m = 0(log| F |(l/5)). We now have 

m 

f = Y. x ^ + f> ( 3 ) 

where /' = f'(x m+ i, . . . ,x n ). Moreover, by Lemma 14.21 it follows that for every y = 
(0, . . . , 0, y m +i, ■ ■ ■ , y n ), rank 3 (Aj,(/)) = 0(log 5 (|)). Notice that for every such y it holds 
that 

m 

A,(/) = 5> A ^) + A,(/')- 

i=l 

Hence, rank 3 (A y (/ / )) < rank 3 (A J/ (/)) + m. We now fix some value to x\, . . . , x m , such that 
bias(/(ai, . . . , a m , x m+h x n )) > 5. Let 

} '(-^m+l) • • • j ^n) f\(%lj ■ ■ ■ j ^mi ^m+l; • • • 5 *^n) • (4) 

It follows that bias(/) > 5 and that for every y = (y m+ i , . . . , y n ) , rank 3 (A y (/)) = 
rank 3 (A y (f)) = 0(log 5 (|)) (note that deg (Ay(f) - \(f')) = 2 so they have the same 

rank). From now on we will only consider / and not /. Observe that if we prove Theorem |3] 
for / then by considering Equations fl3]) and (jl]) we get the required result for / itself. 
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4.2 Computing / using a few quadratics 

We now show that there is a large subspace on which / can be approximated by a function 
of a few quadratic polynomials. The following lemma of Bogdanov and Viola shows that if 
/ is biased then it can be well approximated by a small set of partial derivatives. 

Lemma 4.3. (Lemma 24 from IBVOT^ ) Let f : F™ — > F be a function over a finite field 
¥ with bias(/) = 5. Then there are t directions ai,...,a t and a function H such that 
H(A ai (/),..., A at (/)) e-approximates f, where t < (1 + log \) (|F|/5)° (1) . 

By the construction of / we know that each of its partial derivatives is of rank log *' 1 -' (1/ 5) 
and that bias(/) > 5. Thus, Lemma I4T31 guarantees that / can be well approximated using 
a few quadratics. 

Corollary 4.4. For every e > there are c = (1 + log^) (|F|/5)°^ quadratic polynomials 
Qi, . . . ,Q C and a function H such that f is e- approximated by H(Qi, . . . , Q c ). 

The next lemma, which is the main lemma of [KL08J shows that if the approximation 
is good enough (i.e. e is small), and if the quadratics satisfy the strong regularity property 
then / can in fact be computed by a small number of quadratics. 

Definition 4.5. (strongly regular quadratic functions) We say that a family of quadratic 
functions {Qi}1=i is 7- strongly regular if the following holds for every xq G F ra : for inde- 
pendent uniform random variables Y\, Y5 the joint distribution of 

iQj (xo + Y,yA \j G [m], I C [5], 1 < \I\ < 2 j 

is 7 close to the uniform distribution (recall Definition \2.1\) . 

This definition is a restricted version of Definition 8 of [KLQ8] for quadratic polynomials. 
The interested reader is referred to that paper for the general definition for higher degree 
polynomials. 

Lemma 4.6. (Lemma 13 from [KL08]) Let f(x) be a degree d polynomials, hi, ...,h m poly- 
nomials of degree less than d and H : F m — > F a function such that 

• H(h%, h m ) e-approximates f where e < 2~ 2( - d+1 \ 

• i s a ^-strongly regular family where 7 < min {2~ d , 2 _m } . 
Then there exists a function F : F m — > F such that f = F(hi, h m ). 

In other words, the lemma says that if / is well approximated by a family of strongly 
regular functions then it can actually be computed everywhere by the functions in the family. 
We shall now show that if q%, . . . , q c are quadratic polynomials such that the rank of every 
nontrivial linear combination of them is high, then they are strongly regular. This will imply 
(by Corollary I4.4j) that / is a function of a few quadratics and therefore so is /. 
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Lemma 4.7. Let {Qi} 1 ^L 1 be a family of quadratic functions such that for every nontrivial 
linear combination, rank 2 (^™ 1 otiQj) > R. Then {Qi\^ =1 is a '-/-strongly regular family for 
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|]p|3m/2-_R/4 



Proof. The proof is based on the analogy between quadratic functions and matrices. 

Definition 4.8. Let Q : F n — >■ F be a quadratic polynomial and A G F nxn an n x n matrix. 
We say that A represents Q if there exists a linear function £ such that Q(x) = x t Ax + £(x). 

Notice that there may be many different matrices representing the same polynomial Q. 
For example, every antisymmetric matrix represents the zero function. More generally, if S 
is antisymmetric then A and A + S represent the same polynomial. 

Lemma 4.9. Let q be a quadratic polynomial. Then rank^g) (recall Definition \2. 6]) is equal 
to the minimal rank of a matrix representing q. Moreover, for every matrix A representing 
q we have that rank(v4 + A t )/2 < rank.2(g) < rank(yl + A 1 ). 

We shall prove the lemma for F = F 2 . The proof for other fields is similar. 

Proof. Let rank 2 (g) = r. Then q can be expressed as Y^l=i (^j=i a i,j x j ) (^j=i ^i,j x j) + 
£(x). Set A = (a,ij),E> = (bij) G F rx ™. It is clear that A t B represents q and that 
Tank(A t B) < r. On the other hand, if q can be represented by a rank r matrix A, then 
let £i, . . . , £ r be a basis for the rows of A, when interpreted as linear functions^ Let A4 be 
the i-th row of A and denote Ai = YTj=% a i,j^j- We have that for some linear function £, 

n n r r f n \ r 

q-£ = x l Ax = ^ x i A i{x) = ^2 x i^2 = ^Jj I ^ a i,j X i I = ' 

i=l i=l j=l j=l \ i=l / j=l 

where £[,..., £' r are linear functions. This implies that rank2(g) < r. Thus, rank2(g) = 
min{rank(v4) | q(x) = x l Ax + £(x)}. 

To prove the second claim, let A be any matrix representing q. We first change the basis 
of the space so that with respect to the new basis q will have the form of Theorem 11.11 
Let T be an invertible matrix representing the change of basis. Clearly, T l AT represents 
qoT = Yll=i x 2i-iX2i + £, where r = rank2(g). Thus, the matrix T*AT can be written as 
D + S where D is a block diagonal matrix consisting of r nonzero blocks of size 2x2 and S 
is a symmetric matrix. We also note that for each 2x2 diagonal block C of D it holds that 
C + C* ^ 0. We thus get that 

rank(A + A 1 ) = rank(T'(A + A*)T) = rank(£> + S + D l + S l ) = rank(£> + D*) . 

Now, for every 2x2 diagonal block C of D we have that 1 < rank(C + C l ) < 2 and so 

rank 2 (g) = r < rank(D + D ) < 2r = 2rank 2 (g) . 

This completes the proof of the Lemma @ □ 



7 I.e. {ai,. . . ,a n ) -^YJi=i a i ' x i- 

8 From the proof it actually follows that over F2, rank^g) = rank(A + A')/2 but this is not the case for 
other prime fields. 
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We continue the proof of Lemma 14.71 Using the above observation we now prove that 
any nontrivial linear combination ^2ke[m] ic\s\ i<|j|<2 a k,iQj( x + J2iei ^) nas n ^S n ra nk ( as a 
quadratic polynomial in the variables Y\ U . . . U I5). 

Fix x = xq and let be a matrix representing Qk- Notice that the quadratic polynomial 
Qk{ x o+^2i & j ^i) ^ ne variables Uf =1 Yj) can be represented by a block matrix B k)I £ ]p5nx5n_ 
Indeed, consider a 5 x 5 matrix that has 1 in the (i, j)-position iff i, j £ /, and zeros otherwise. 
Now, replace any 1 by the matrix Ak and every by the n x n zero matrix. It is an easy 
calculation to see that this matrix represents Qk( x o + Sie/^)- We shall abuse notations 
and for i,jEl say that (B k,I )ij = Ak, and that otherwise (B k)I )ij = 0. 

Clearly, the linear combination 

Q' = X)K,i<2fc(z + X^) I *e [m],ic [5],1< |J| <2} 

iei 

is represented by the matrix 

C 4 Y^{^k,iB k ^ I k £ [m], / C [5], 1 < 1 I\ < 2} . 

Observe that for i ^ j £ [5], Cjj = XlfceH a k,{ij}Ak- We now show that if for some 
i 7^ j £ [5] and /c £ [m] it holds that otk,{i,j} 7^ then the rank of C l + C (and hence of Q') 
is high. 



rank 2 (Q') = rank 2 [V^ l°tk,iQk(x + Y, Y <> \ k £ [m],K [5],1 < |J| < 2^ 



> -rank(C + C l ) > -rank(C^ + C*J 



1 

, -i 

2 ' ~ 2 

^rank j ^ a K{i)j} (A h 

\ke[m] 

> ^rank 2 j ^ a k,{i,j}Qk j > 

\ke[m] J 



4) 



If it is not the case, namely, for all % ^ j £ [5], k £ [m] ct^.^,,-} = 0, then there is some i £ [5] 
and k £ [m] such that 7^ and we get that same result by considering instead. 

To conclude, every nontrivial linear combination of \Qj(x + ^2 ieI ^)}& e r TO i j C [5] i<|/|<2 
has rank grater than ^R. Lemma [3.31 implies that the bias of every such linear combination 
is bounded by IF) - ^ 4 . It now follows by Lemma I2~2l that the distribution is |]p| 3rri / 2 — close 
to the uniform distribution as needed. □ 

We thus get the following corollary. 

Corollary 4.10. Let g(x) be a degree d polynomials, qi,...,q m quadratic polynomials and 
H : ¥ rn — > ¥ a function such that 

• H{h\, h m ) e- approximates g where e < 2~ 2 ^ d+1 \ 
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• The bias of every non trivial combination of hi, ... , h m is |F| n ( m + d ) , 

Then there exists a function G : F m — » F such that g = G(hi, h m ). 
We now show that / can be computed by a few quadratics. 

Lemma 4.11. Let g : F n — >■ F 6e a quartic polynomial such that for every y, rank 3 (A J/ (/)) < 
poly(l/<5). T/ien i/iere exist a subspace W, c = poly(|F|/5) quadratics q[,...,q' c and a 
function G such that, dim(W) = n — poly(|F|/5) and g\w = G(q[, ...,q' c ). 

Proof. Applying Lemma I4.3[ and using the fact that every partial derivative of g has a 
low rank, we conclude that for e = 2 -20 there exist c = poly(|F|/<5) linear and quadratic 
functions, and a function H, such that H(i\, . . . , £ c , q%, . . . , q c ) e-approximates g. Let 
r = poly(|F|/<5) and U = {x : £i(x) = ax,...,£ c (x) = a c } be some subspace such that 
H(a>i, . . . , a c , qi\u, ■ ■ ■ , q c \u) e-approximates g\u- Applying Lemma [3781 on q±\u, ■ ■ ■ , q c \u an d 
r we get that there exists a (possible affine) subspace W C U and d < c such that: 
dim(VF) > dim(C7) — (r + l)c > n — (r + 2)c = n — poly(|F|/5); w.l.o.g. for every i — 1 . . . c, 
gi | vk £ span{gi|w, . . . , g c '|ty}; an y nontrivial linear combination of qi\w, ■ ■ ■ , Qcf\w has rank 
larger than r; g\w is e-approximated by H(£i\w, ■ ■ ■ , £ c \w, Qi\w, ■ ■ ■ , Qc\w) (this follows by 
picking an adequate shift of the linear space in the lemma). Hence, g\w is e-approximated 
by H(£i\w, ■ ■ ■ ,£ c \w, Q.i\w, ■ ■ ■ , Qc\w) = H'(qx\w, ■ ■ ■ , q c '\w) for some H'. The reason for 
passing to W is that now any nontrivial linear combination of qi\w > • ■ ■ > Qd\w nas rank 
larger than r. We thus get by Corollary 14.101 that there is some function G such that 
9\w — G(qi\w, ■ ■ ■ , Qd\w)- n 

Recall that we assume w.l.o.g. that for every y G F n ~ m , rank 3 (A y (/)) < poly(l/5). 
Thus, the lemma above implies the following corollary. 

Corollary 4.12. In the notations of the proof, there exist a subspace Z C F™~ m of dimension 
dim(Z) > n — poly(|F|/5) such that f\z = F(qi, . . . , q c ), for c = poly(|F|/<5) quadratic 
polynomials and some function F. 

4.3 The structure of / 

We now show that we can represent / as / = Yli=i ^ ' fl 1 * + Yli=i Qi ' Qi where k = poly ( |F| / S) , 
the £i~s are linear, the q[-s and q"-s are quadratic and the g^-s are cubic polynomials. For this 
we will transform the quadratic polynomials to be what we denote as disjoint polynomials. 

Definition 4.13. We say that the quadratic polynomials {Qi}™^ are disjoint if there is a 
linear transformation T, 2m variables {xj}™ =1 U{?/j}™ 1 , where possibly for several i-s Xi = yi, 
and quadratic functions {Q'j}^ such that for every k £ [m], Qk °T = xj-yk + Q'k where no 
degree two monomial in Q' k contains a variable from {xi\™ =l U {2/i}™ r 

Lemma 4.14. Let g 1; . . . , q c be quadratic polynomials from F n to F. Assume that the rank of 
every nontrivial linear combination of them is at least r. Then there exist a subspace 7CF n 
of dimension > n — 2c 2 and d < c quadratic polynomials q[, . . . , q' c , : V — > F satisfying: the 
q[-s are disjoint; every nontrivial linear combination of the q[-s has rank at least r — 2c 2 ; 
span(g^,. . . ,q' c ,) = spau(gi|y, . . . , q c \ v ). 
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Proof. We prove the lemma by iteratively changing each to a 'disjoint' form. We shall 
give the proof over F 2 but almost the same proof holds for odd characteristics as well. We 
start with q\. Assume w.l.o.g. that x\ ■ a; 2 appears in q\. Now, from every other g, subtract 
an appropriate multiple of q\ such that at the end X\ ■ x 2 only appears in q\. For simplicity 
we call the new polynomial qi as well. Now, for 2 < i and j G {1,2} let Xj ■ £y be the degree 
two monomials involving Xj in q^. For q\ let Xj ■ t\j be the degree 2 monomials involving 
Xj in qi — x\ ■ x 2 . Let V\ — {x \ £i,i(x) = . . . = £ 2jC (:r) = 0}. Notice that none of the iij-s 
contain x\ or x 2 . After restricting the polynomials to V\ we have that x\ ■ x 2 appears in q\ 
and every other appearance of either x\ or x 2 is in degree one monomials. We now move to 
(the 'new') g 2 and continue this process. At the end we obtain a subspace V and quadratics 
q[, . . . , q' d (c' may be smaller than c if some polynomials vanished in the process). As at each 
step we set at most 2c linear functions to zero, for a total of at mots 2c 2 linear functions, the 
claims about the dimension of V and the rank of every linear combination of the g^-s follow. 
It is clear that the g^-s span the g--s and so the lemma is proved. 

When dealing with odd characteristics instead of looking for x\ ■ x 2 we search for x\. By 
applying an invertible linear transformation such a monomial always exists and we continue 
with the same argument. □ 

The usefulness of the definition is demonstrated in the following lemma. 

Lemma 4.15. Let g 1; . . . , g c be disjoint quadratic polynomials. Assume that deg(f) = 2d and 
f = F(qi, . . . , g c ) for some function F(zi, . . . , z c ). Then as a polynomial overW, deg(F) < d. 

Proof. We shall give the proof over F 2 but it is again similar over odd characteristic fields. 
Let zl 1 ■ ■ ■ z e c c be a monomial of maximal degree in F. When composing it with g 1; . . . , g c 
we get that g^ 1 • • • q e c c contains the monomial rii=i( a; « ' Ui) ei - ^ s ^l 1 ' ' ' z t c ls °f maximal 
degree and each Xj and yi appear only as linear terms in all the g^-s (except the monomial 
Xi -yi in qi) we see that this monomial cannot be cancelled by any other monomial created in 
F(qi, . . . , g c ). Therefore the monomial ni=i( x « 'Vi) ei belongs to / as well. Since deg(/) = 2d 
it must be the case that 2e\ + . . . + 2e c < 2d. Hence, deg(F) = Ylt=i e « — d. □ 

We are now ready to complete the proof of Theorem |3j 

Proof of Theorem [3 Combining Corollary I4.12| Lemma 14.151 and Lemma 14.141 we get that 
for the subspace Z of Corollary 14. 12\ there exist a subspace Z' C Z, of dimension dim(Z') > 
dim(Z) — poly(|F|/5), b = poly(|F|/5) quadratic polynomials Qi,...,Qb and a quadratic 
polynomial H such that f\z> = H(Qi, • • • , Qb)- In other words f\z f = J2i<j a i,jQiQj + Qo- 

As f\z> = f\z> it follows that j\z> = Yl,i<j a i,jQiQj + Qo- Assume w.l.o.g@ that Z' is 
defined as Z' — {x \ x\ — . . . , x^ — (3k} for some k = poly(|F|/5). Then it is clear that 
we can write / = ^* =1 x { ■ & + J2i<j a ijQiQj + 9o for cnbic polynomials g , . . . , g k . 

□ 

9 This is true up to an invertible linear transformation and an affine shift and has no real effect on the 
result, but rather simplifies the notations. 
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5 Quartic polynomials with high U A norm 



In this section we prove Theorems H] and [5j Intuitively, the notion of d + 1 Gowers norm 
indicates how close a given function is to a degree d polynomial. In fact, it was conjec- 
tured that if the U d+1 norm is bounded away from zero then the function has a noticeable 
correlation with a degree d polynomial. This conjecture turned to be false even when the 
function is a degree four polynomial and d = 3 |LMS08| IGT07j . Here we will show that for 
this special case a weaker conclusion holds. Namely, that for any degree four polynomial / 
there exists a subspace of dimension n/ exp(poly(l/||/|| c/ 4)) on which f\ v is equal to some 
cubic polynomial. In fact an even stronger conclusion holds - there exists a partition of (a 
subspace of small co dimension of) F n to such subspaces on which / equals a cubic. To ease 
the reading we restate Theorem H] here. 

Theorem (Theorem H]). Let ¥ be a finite field and f G ¥[x\, . . . , x n ] a degree four polynomial 
such that || / 1 |t/4 = 5. Then there exists a partition of a subspace V C F", of dimension 
dim(V) = n — poly(|F|/5) ; to subspaces {V a } eI , satisfying dim(V a ) = fi(n/|F| poly(1 / 5 )) , such 
that for every a G I, f\y a is a cubic polynomial. 

In other words, the theorem says that for r = poly (1/5) any such / (possibly after a 
change of basis of F n ) can be written as / = Y^i= i x n-r+igi( x i, ■ ■ ■ , x n ) +f r (x\, . . . , x n - r ) +go, 
where the gi-s are degree three polynomials and /' is a polynomial for which there exists 
a partition of F n_r to subspaces {V a } a£l , satisfying dim(V^,) = Q(n/ exp(poly(l/5))), such 
that for every a G I, f'\v a is a cubic polynomial. 

As in the proof of Theorem |2] we start by passing to a subspace of a constant codimension 
on which every derivative has low rank, i.e A y (f) = Yli=i^iQi + Qo- Then we shall deduce 
that there is some common 'basis' > {QiYi=\ to all the derivatives. Namely, every 

derivative A y (F) can be expressed as Yli=i ^\Qi + ^iQ\ + Qo (where y in the exponent 
means that the polynomial may depend on y). This is the main technical difficulty of the 
proof and it is based on an extension of Lemma I3T71 to the case of low rank cubic polynomials. 
Then, we conclude that for every setting a of {£i}f =1 , {Qi}^ we obtain a subspace V a on 
which all the derivative are quadratic polynomials, i.e f\y a is cubic. 

5.1 The case of the symmetric polynomial 

Let Sk(xi, . . . , x n ) = ^ 1 <j 1< <i fe < n x h • x i 2 ' ' " x i k - I n |GT07t [LMS08J it was shown that over 
F 2 , it holds that HS^H^ > 5, for some absolute constant < 5, but for every degree three 
polynomial g, Prf^ = g] < l/2 + exp(— n). To make the claim of Theorem H] clearer we shall 
work out the case of S4 as an example. 

Consider a partial derivative A y (S , 4). For simplicity assume that n = Am. Computing 
we get that 

n n n 

(5) 
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In particular, S 2 is a 'basis' for the set of partial derivatives of S4. Continuing, we have that 

2m /2fe-l \ / 2fe-2 \ m 

s 2 (x 1 , ...,x n ) = y*\y* j x i \ ■ 1 s 2 fe + ^2 Xi ) + X^ 4 *- 3 + X4i - 2 ) • ( 6 ) 

k=i \i=i J V 1=1 / i=\ 

For A; = l,...,2m let = X^=i~ lx «- Notice that fixing £i,...,£ 2 m reduces the de- 
gree of S 2 to one and so every partial derivative of S4 will have degree two. For ex- 
ample, consider the space V = {x \ £\{x) = ... = £ 2m {x) = 0}. Rewriting we get 
Vo = {(0,yi,yi,|/2,2/2, • • • ,?/2m-i2/2m-i,2/2m)}- Computing we get that 

Si\v = S 2 (yi, • • ■ , J/2m-l) • 

A closer inspection shows that no matter how we set £1, ... , £ 2m we will get that the degree 
of S4 becomes two. 



5.2 Finding a 'basis' for a space of low rank cubic polynomials 

In this section we prove the main technical result showing that a subspace of degree 3 
polynomials with low rank has a small 'basis'. 

Lemma 5.1 (Main Lemma). Let M be a vector space of cubic polynomials satisfying 
ranks(/) < r for all f G M. Then there exists a set of linear and quadratic functions 
{QifiLi U for h<r and t 2 = 2°< r >, such that every f G M can be represented as 

f = Yui=i nQi + Yll=i ^iQi + Qo f or some linear and quadratic functions {£{ U {Q/}*Lo ■ 

The rest of this section is devoted to proving this lemma. Similarly to the proof of 
Lemma 13.71 we will work modulo a collection of linear and quadratic polynomials. For this 
we shall need the following definition. 

Definition 5.2. For a cubic polynomial f we say that rankg(/) = r if r is the minimal 
integer such that f can be written as 

/ = E^ + E4 1 M 2) 4 3) + Qo, (7) 

i=l i=l 

where the £-s are linear functions and the Q-s are quadratics. 

To see that difference from the previous notion of ranks (Definition I4.ip we observe that 
if / is a degree three polynomial with rank 3 (/) = r then / = Y^=i ^iQi + Qo- If we also know 
that some nontrivial linear combination of Qx, . . . , Q r has rank (as a quadratic polynomial) 
less than c then rankg(/) < r. I.e. rankg(/) ignores, in some sense, low rank quadratic 
functions in the representation of /. 

Definition 5.3. Let A = {Qij-Li U {A}^ be a set of linear and quadratic functions and let 
f : F n — > ¥ be a degree three polynomial. Denote 

{tl t-2 \ 

f + P'iQi + ^ ~^ I ^ or ^ near an( ^ quadratic functions {-^}* =1 > {Q'i}f=o ( 
i=l i=l ) 
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For a linear space M of degree three functions, we define the subspace [M]a to be 

[M] A ±{[f] A \feM} . 

As before we define rankgQ/]^) to be the lowest rank of functions in [f] A - 

rank^([/] A ) = min{rankg^) | g G [f} A } . 

The definition of [f] A resembles, in some sense, the notion of working modulo an ideal. 
However, we note that as opposed to the usual definition, where for every /, {Q'}*^ =1 U{^}*? =1 
can be arbitrary functions, in our definition they are restricted to being quadratic and linear 
functions, respectively 

We are now ready to prove the main lemma of this section that shows the existence of a 
small 'basis' for any linear space of cubic polynomials of low rank. 

Lemma 5.4. Let A = U ^ e a se ^ °f linear and quadratic polynomials. Let 

M be a linear space of cubic polynomials such that for every [f] A G [M]a, ran kjj([.f]A) < 
r. Then, there are r linear functions {£i} r i=1 and a quadratic polynomial Q such that for 
A' = A U {£i} r i=1 U {Q} it holds that every [f]^ G [M]a> satisfies rankg ([/]a0 < r — 1; f or 
d = 11c + 3r + ti. 

In other words, the lemma says that we can find a small set of linear functions and one 
quadratic polynomial such that by adding them to A and increasing c by a constant factor, 
we can decrease the rankg of every polynomial in [M]a'- 

Proof. Assume that there is some [g] A G [M]a such that rankg (g) = rankg = t. If 

no such g exists then for every [/]a G [M]a, rankg ([/]a) < r — 1 and there is nothing 
to prove. As c < c' it also holds that rankgdg]^) = r. Hence, g can be represented as 
EJ=i W + ^ =1 £f (1) £f (2) £f (3) Note that rank 2 ([Q?] A ) > d - c as otherwise we could 
replace Q\ with a function of the form Y^i=i a iQi + Y^l=i ^% + Y^jZ\ m i m 'ii where the m-s 
are linear functions, and get that rankg {[g\A) < r — \. 

Set A' = A U {£fY i=l U {Qf}. Assume for contradiction that there is some h G M 
satisfying rankg ([/i]A') = r. This implies that rank^f/i^) = r and that rankg ~ c ([h+g]A') = r 
as well. Indeed, if the latter equation was not true then by expressing h + g as a low 
rankg ~ c polynomial and moving g to the other side we would get that rankg ( [h] a' ) < r 
in contradiction (recall that {£f} C A'). From this we get that rankg ([h + g]^) = r as 
well. Let / G [h + g]A be such that rankg(/) = r. Express h and / as h — Y^i=i^iQi + 
^ ^,(1)^,(2)^,(3) and f = Y^ x l{Q{ + Eti4 /,(1) 4' (2) 4' (3) - Note that we can assume 
that w.l.o.g {li\l =l are linearly independent as otherwise we can just replace them with a 
linearly independent subset. Similarly, we can also assume that U {£% } r i=l are linearly 

independent as otherwise we can find a representation for a function in [<?]a with a smaller 
rank. Using the same argument again we conclude that {£i}f =1 U{£f} r i=1 u{£^Y i=1 are linearly 
independent as well (by considering [/i]aO- 

10 By definition of [g] a we can add any quadratic polynomial to g so we can assume that there is no extra 
Qq term in the representation of g. 
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Since g + h — / e [0]^, we can express this polynomial as g + h — f = Yli=i ^iQi + 
Yfi=i ^iQ'i + Q'o- In other words: 



£ w + £ * 



5,(1) ,9,(2) ,9,(3) 



+£##+£ 



^,(1)^,(2)^,(3) 



i=l 



i=l 



i=l 



i=l 



*2 



E + E - w W 8) + E + E ^ + Qo 



. 



i=l 



i=l 



i=l 



To ease notations, using the fact that U {if }[ =1 U {^} r , are linearly independent, let 

us assume w.l.o.g. that Vz, l\ = Xi, = x r+ i and ii = X2 r +i- Thus, Equation (jHj) becomes 



E^f + E 



*S.(1)*S-(2)«S,(3) 



i i i 



E Xr + i( 5i i + E 



^,(1)^,(2)^,(3) 



i=l 



i=l 



i=l 



/,(!),/, (2),/, (3) 



tl 



i=l 
*2 



.1=1 



i I I 

1=1 1=1 



+ £^Qi + £^2r+i^ + Qo = 0, 



(9) 



i=l 



where we remember that variables from {^i}^}* 2 may appear in the linear and quadratic 
functions in the expression. Consider all terms involving x\ (recall that if = x\) in Equa- 
tion @. Clearly they sum to zero, but they can also be written as 



3c 



3c 



o = g? + £x,mf + E«f 



my m 



i=i 



i=i 



3c 



£/5M+£^/+£/? 



ti 



+ E^+^ + e 

i=l i=l 

*1 fa 



/i h,(l) h,(2) 



, n ' ; '»■; ' + £ PlQi + £ K m i + £ x 2r +M + m ] ,(io) 

, i=l i=l i=l i=l i=l i=l 



where the m-s are linear functions and the as and /3-s are field elements. Rearranging terms 
we conclude that 

/ r ti t 2 \ 

rank 2 Q? - £ /?/ Q{ - £ - E S 2r+X < 3r + 9c + t x = d - 2c . (11) 



i=i 



i=i 



i=i 



This implies that 



ranko 



i=l 



< d - 2c . 



We now have two cases to consider. If (/?/,..., are not all zero then, by arguments 
described above, this implies that rankg ~ c ([/]a') < r — 1. Recalling that [h + g]A> = [/]a ; 
we get a contradiction. If, on the other hand, (/3(, . . . , /?/) = then Equation (ITTj) implies 
that rank2([<5?]A) < c' — 2c and so rank^ _c ([(7]a) < r — 1 in contradiction to the choice of g. 
Concluding, we have that for every / 6 M, rankg ([/]a0 < r ~ 1 as required. □ 



By applying Lemma 15.41 r times we obtain the following corollary. 
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Corollary 5.5. Let M be a vector space of cubic polynomials satisfying rank 3 (/) < r for 
every f G M. Then there exists a set of quadratic and linear functions A = {Qi} r i=1 U 
{A}i=i , such that for c = exp(r), rank^Q/]^) = for every f G M. 

We now have that every function in M, modulo some set A of linear and quadratic 
functions, can be expressed as YH=i 4*' \ f° r some c. Next we show that we can add 
3c additional linear functions to A such that modulo the new set every function becomes 
zero. We again give an iterative procedure for finding those linear functions. 

Before proving this result we define the notion of dim^f/]^) that will serve as a potential 
function in our argument (in a similar way to the role played by rankg). 

Definition 5.6. Let A be a set of quadratic and linear functions and [f}A a, class of cubic 
functions such that rankgQ/]^) = 0. We define the dimension of the class as follows: 

dimS([/] A ) =min|dim(span{4 1) ,4 2) ,4 3) }' =i ) | f^^ifhf G [/] A J . 

To better understand the reason for the definition we note that if rank^Q/]^) = then 
Yli=i ^P&i^^i* + Q £ [f] A f° r some linear functions and quadratic Q. Thus, our goal will 
be to find a small set of linear functions that, simultaneously, form a basis to all those linear 

f (1) (2) (3) ~1 c 

functions for all / G M. The next lemma shows that by joining < l\ , i\ , t\ > from some 

polynomial /, of maximal dimension in [M] A , to A, the dimension of every other element in 
[M]a decreases. 

Lemma 5.7. Let A = {Qi\l =l U {£iYi=i be a set of linear and quadratic functions. Assume 
that the rank of any nontrivial linear combination of {QiYi=i ^ s greater than 9c + ti +£2- Let 
M be a linear space of cubic polynomials such that for every [f] A € [M]a, ran k3([/]A) = 
and dimg([/] A ) < d. Then, there are d linear functions {£'i} i=1 such that for A' = AU{f-} j=1 , 
dim&[f] A ,) <d-lfor all \f\ A , G [M\ A ,. 

The proof is very similar in nature to the proof of Lemma 15.41 

Proof. We start by passing to the subspace V = {x \ £i(x) = ... = £t 2 (x) = 0}. When 
restricting the Qi-s to V the rank of every linear combination can drop by at most ti so it is 
still at least 9c + 1\. From now on we shall work over V. Note that if we prove the theorem 
over V then it clearly holds over F n as well. 

Let [g] A G [M] A be a class satisfying dimgQg]^) = d. By definition we can assume 
that g is such that g = Yli=i '^^f' £f , and that for some d linearly independent linear 
functions {£f} d i=1 it holds that {lf (1) , £f {2 \ £f (3) } C _ C span^f}^- Set A' = A U {£f}ti- 
We will show that for every / G M it holds that dinig ( [/] A i ) < d — 1 . 

Assume for contradiction that there is some [h] A > G [M] A i such that dim^f/i]^/) = d. 
Clearly, dim c 3 ([h] A ) = d as well. W.l.o.g. let h = ^-=i £^' {1) £-' {2) £- ,{3) . We also denote with 
{£i}^ =1 a basis for |^'^, £^ 2 \ ^M 3 ^ | . As dim^f/i]^) does not decreases modulo {£f}f =1 , 
it follows that {£f}^ =1 U are linearly independent. By definition of A' we have that 
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dim^ + h] A >) = dim c 3 ([h] A >) = d. Let / G [g + h] A be such that / = J^Ui l{ m 
and dim(span{£{' (j ' ) }) = d. Since g + h — f G [0]a we have that g + h — f = Y^i=x + Q' • 
We now show that all the i'^-s are zero. Assume for contradiction that this is not the case. 
Namely, {^j-li are not all zero. In particular, some £[ depends on some variable x. Write 
g + h — f = xF + H where H does not depend on x. We now estimate rank 2 (-F). On the one 
hand F can be expressed as Y^i=i a iQi + m d'i + m o for some coefficients {at] •^ =1 (not 
all of them are zero) and some linear functions {^i}*L - Hence, rank^-F) is larger than 9c 
(remember that rank 2 (^*^ =1 a^Qi) > 9c + t\ on V). On the other hand, g + h — f is equal to 



, ^ _ f _ ^9,(1)^,(2)^,(3) ^,(1)^,(2)^,(3) _ V^/,(%/,(2)^/,(3) 

«* / j % % i / j % % % / j % 

8=1 i=l i=l 

so F can be expressed as ^<=i ^i^i + ^ i- e it's rank is at most 9c, in contradiction. It 
follows that g + h — / = Q, for some quadratic Q. Thus, 



c c 



£ ^,(1)^,(3) + J- ^W^W^W = £ €f (1) £f' (2) £f' (3) + Q . (12) 



i=l j=l i=l 



For simplicity, assume w.l.o.g. that for % = 1 . . . d, if = yi, t\ = z^. We would like to 
show that if Equation (1121) holds then deg(h) = 2 in contradiction to the choice of h. 
To further simplify notations we assume w.l.o.g. that the q-s are linear functions in the 
variables y% . . . , ya, Z\, . . . , Za (as we can set all other variables to zero and still obtain a 
similar equality). In particular, every l{ can be expressed as i{ = i{ ,9 {y) + i{' h (z). Hence, 
Equation ( 1T2|) can be rewritten as Q(y,z) + f(y,z) = g(y) + h(z). Therefore, it holds 
that g(y) = f(y, 0) + Q(y,0) and h(y) = f(0,z) + Q(0, z)o In particular, there is some 

representation of g and h as sums of products of linear functions such that j^f' 5 (y) j and 
j^f' h (z) j are their basis, respectively. By applying an invertible linear transformation we 

can further assume that £{' 9 (y) = y { and £ f /' '(z) = Zi. Thus, the basis for is i{ = 

yi + zi, . . . , £^ = yd + z d . As a consequence we have that / = YH=i \V + z )^( (v + 
z)t{> {3 \y + z). 

Define F : ¥ d -> F as F{u) = J2i=i ^ ,(1) (u)^ ,(2) (u)€f ,(3) (u). Hence, / = F{y + z), 
g = /(y,0) + Q(y,0) = F(y) + Q'(y) and h = F{z) + Q"{z). Thus, for every a, (3 E F d 
F(a + (3) = F(a) + F{j5) + Q(a, f3). It is not difficult to check that if F is a polynomial such 
that deg (F{a + p) - F{a) - F((3)) < 2 then deg(F) < 2. Therefore, [h] A = [F(z)] A = [0] A 
(because F is quadratic), in contrary to the fact that dim^f/i]^) = d. We thus deduce that 
for every [h] A > G [M]^/, dim^^]^/) < d as required. □ 

Combining Lemma 15.41 and Lemma 15.71 we are now able to prove Lemma 15.11 



Proof of Lemma \5.1l Corollary 15.51 implies that there exists a set of quadratic and linear 
functions A = {Qi} r i=1 U , such that for c = exp(r), rank^Q/]^) = for every 

/ G M. By Lemma 13.81 we can assume w.l.o.g. that every nontrivial linear combination of 



i We can assume w.l.o.g. that l\ and l\ do not have a constant term. 
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the Qj's have rank larger than 10c (possibly after passing to a subspace V of dimension at 
least n — poly (c) = n — exp(r) and throwing some of the Qi-s (without changing the property 
of [M] A )). By applying Lemma ED d = 3c times we get a set A' = {Q-}*L a U for 
£1 < r and £ 2 = exp(r), such that dimljQ/]^/) = for every [f\A' G [M]^/. In particular, 
every / G M can be represented as / = Eti4<2; + Eli W + ^6 for some linear and 

quadratic functions | £f | U |q{ | depending on /. □ 
5.3 Completing the proof 

We can now complete the proof of Theorem HI We first give a lemma summarizing what we 
have achieved so far. 

Lemma 5.8. Let f be a degree four polynomial with \\f\\u' t — Then for r = O (log 2 (1/5)) 
there exist a subspace V , satisfying dim(V) > n — 0(log(l/<5)) ; r quadratic polynomials 
Qi, . . . ,Q r and R = exp(r) linear functions £i, . . . ,£r such that for every y G V we have 

that A y (f\ v ) = YJ i=1 Qi-£ y i + Ef=i U ■ Ql + Ql 

Proof. Let / be a quartic function such that H/Ht/ 4 > 5. By Lemma [3T2l Theorem |2] and 
Lemma |2~o1 there is a subspace V, satisfying dim(l/) > n — 0(log(l/<5)), such that every 
partial derivative of f\y is a cubic polynomial of rank at most r = O (log 2 (1/5)). Let 
/' = f\v- Lemma I5TT1 gives a set A = {Qi}^ =1 U such that every A y (f) can be 

written as A y (f) = £[=i Qv^+Y^x^ ZrQi+Qo- Notice that the lemma 

concerns a linear 

space of cubic polynomials. In our case the linear space will be the span of all the partial 
derivatives of /'. As for every y, z G V it holds that deg (A y (f) + A z (f) - A y+Z (f')) = 2, 
we see that in order to 'close' the space we only need to add quadratic polynomials and so 
the assumption about the rank of the cubic polynomials in the space does not change. □ 

Proof of Theorem^ By Lemma l5\8l for r = O (log 2 (1/5)) there exist r quadratics Qx, . . . , Q r 
and R = exp(r) linear functions £i, ■ ■ ■ ,£r such that for every y G V we have that A y (f\ v ) = 

We now wish to express each Qi in the form of Theorem ll.il We have two cases. Assume 
first that F = F 2 . Then for every 1 < i < r we have that Qi = Ei=i^,j ' ^ij + ^».o- 
For a G ¥ R let V a = {x G V | VI < % < R } £i(x) = a*}. Clearly dim(V a ) > 
dim(V) - R. Let f a = f\ Va . Then for every y G V a , A y {f a ) = ELi^k ' C + Q'l 
We now repeat the following process for each 1 < % < r. Assume that we are 
working over a subspace V^i^.^-i, of dimension <ij_i = dim \V a ^x i ...^i-i). Consider 

^l^./si,...,^-!' By Theorem Owe can write Qi\v a ^ t ^ = Y.t=i /2 kj " 4,j + ko- For 
^ £ F 'i_ l/2 define Va ^^i = {x G V a ^ r ^i-x) VI '<j < dt-i/2, £i, 3 = 08*),-}. Note 
that U j g lgF d l _ 1 /2 V^gi^.^i = V aj pi r „ t pi-i. Thus, the set {V^g^...^} forms a partition of 

V. Moreover, observe that for every a, (3 1 , . . . , (3\ deg(^Qi\v a/3l < 1- Thus, for 

every a, (3 1 , . . . , (3 r , all the partial derivatives of f\v a0l gr are °f degree two and so 

deg(^f\v af3l 9r j < 3 as claimed. To finish the proof we note that dim (V^ )( gi ( ... )( g») > 

dim (y^i,...^) /2. Therefore, dim (K,^,...^) > (n - R))/2 r = n/exp(log 2 (l/<5)). 
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When char(F) = p 
Rewriting we obtain 



> 2 



we have the representation Qi\v 



Z-/i=l ~ Ho- 



r 




i=i 

di-i/p p-1 





»=i \i=i j=i 



Observe that after fixing VI < j < p - 1, £ pi+j - £ pi = ((3 l )j, Qi\v afjl pi 
Thus, the same argument as before gives the required result here as well. 



becomes linear. 

□ 



Combining the idea of the above proof with the notion of disjoint polynomials we prove 
Theorem [5j 

Proof Sketch of Theorem^ As in the proof of Theorem H] we obtain linear {£i}i=i...R and 
quadratic {qi}i=i... r , where r = 0(log 2 (l/5)) and R = exp(r), that form a 'basis' to the set 
of partial derivatives. By passing to a subspace of codimension R and using Lemma \4. 141 we 
can assume w.l.o.g. that the g;-s are disjoint and that every partial derivative has the form 
A y {f) = Y7i=i Qi ' 4 + Qo ■ As char(F) > 4 we can assume w.l.o.g. that qi = x\ + q[ and 
that Xj can appear in q[ only as a linear term. We now subtract from / terms of the form 
aqiqj such that in the resulting polynomial /' there will be no monomial of the form xfxj for 

i < j. Note that /' also has the property that for every y, A y (f) = YHi=\ <?« ' ^ i + Qo^ ■ We 
now show that degree four monomials in /' may only contain Xj or xf but not x 2 , for i G [r]. 
Indeed, assume for a contrary that x\ appears in a degree four monomial. Then, Xi appears 
in A x .(f) in a degree three monomial. This monomial comes from some i^^qj for j ^ i. 
Therefore, we also have the term XiX^ in A Xi (/') (it is not difficult to see that this term cannot 

be cancelled by any other i^qk)- As char(F) > 4, integration w.r.t. Xi gives that the term 
xjx 2 j appears in /' in contradiction. We can thus write /' = Y^!i=i x \^i + f" ■> where in /" each 
Xi has degree at most one. Consider any y 'orthogonal' to {xi, . . . , x r , £\, . . . , £ r } (namely, 
substituting y in any of those linear functions gives zero). Then for each i, A y (x^£i) = 0. 
Hence, Xi is the highest power of Xi appearing in A y (f). As the g^-s are disjoint and 

A v(f) = EI=i Qi-t^ + rfa* we obtain that it must be the case that deg(Ay(/')) < 2. Thus, 
/' can be rewritten as a polynomial in at most 2r variables plus a degree three polynomial. 
Therefore, possibly after a change of basis we can write / = Ej<j a i,jQi ' Qj + X^It^ Vi'9i + 9o 
as needed. □ 

6 Conclusions 

In this paper we gave strong structural results for degree three and four polynomials that 
have a high bias. It is a very interesting question whether such a structure exists for higher 
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degree biased polynomials. Green and Tao |GT07] proved such a result when deg(/) < |F| 
(with much worse parameters for degrees three and four), so this question is mainly open 
for small fields. Another interesting question is improving the parameters in the results 
of |GT07| IKL08| . There it was shown that when deg(/) = d and / is biased then / = 
F(g 1 , . . . ,g Cd ), where deg(^j) < deg(/). However, the dependence of q on the degree d and 
the bias 5 is terrible. Basically, c 3 = exp (poly (1/5) and q is a tower of height Q-i- In 
contrast, our results give that c 3 = log 2 (1/5) and c 4 = poly (1/5). Thus, it is an intriguing 
question to find the true dependence of q on 5. In particular, as far as we know, it may 
be the case that q is polynomial in 1/5 (where the exponent may depend on d), or even 
poly(log(l/5)). 

For the case of degree four polynomials with high [7 4 norm we proved an inverse theorem 
showing that on many subspaces, of dimension Q(n), f equals to a degree three polynomial 
(a different polynomial for each subspace). Such a result seems unlikely to be true for higher 
degrees. However, it may be the case that if deg(/) = d and / has a high U norm then / 
is correlated with a lower degree polynomial on a high dimensional subspace. 
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